• SEQUENCE LISTING 

(1) INFORMATION FOR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 6 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: internal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

Asn Asp Gin Val Leu Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 6 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: internal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Phe Glu Asp Met Thr Asp 
1 5 

(3) INFORMATION FOR SEQ ID NO: 3: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 7 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: internal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Phe Lys Leu lie Leu Lys Lys 
1 5 

(4) INFORMATION FOR SEQ ID NO: 4: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 5 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: internal fragment 



- 40 - 



( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Tyr Lys Asp Ser 
1 5 

(5) INFORMATION FOR SEQ ID NO: 5: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 5 amino acids 
( B ) TYPE : amino acid 
( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: internal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Thr Leu Ser Cys 
1 5 

(6) INFORMATION FOR SEQ ID NO: 6: 



( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 157 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 



(ii)MOLECULE TYPE: peptide 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 






5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp 

35 


Ser 


Asp 


Cys 


Arg 


Asp 
40 


Asn 


Ala 


Pro 


Arg 


Thr 
45 


He 


Phe 


He 


He 


Ser 
50 


Met 


Tyr 


Lys 


Asp 


Ser 
55 


Gin 


Pro 


Arg 


Gly 


Met 
60 


Ala 


Val 


Thr 


He 


Ser 


Val 


Lys 


Cys 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 


65 






70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 








85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 






100 










105 










110 






Met 


Gin 


Phe 
115 


Glu 


Ser 


Ser 


Ser 


Tyr 
120 


Glu 


Gly 


Tyr 


Phe 


Leu 
125 


Ala 


Cys 


Glu 


Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 


130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 









145 150 155 



(7) INFORMATION FOR SEQ ID NO: 7: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 157 amino acids 
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( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii)MOLECULE TYPE: peptide 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Asn 


rile 




Arg 


Leu 


HIS 


Lys 


i nr 


r"P -r~ 
1 III 


A "1 -=j 
/A _L CI 


V a _L 


11c 


Arg 


Asn 


J- -L tt 


Asn 


l 








D 










1 u 










1 D 




Asp 


P 1 

bin 


vai 


Leu 


rile 


vax 


Asp 


Lys 


Arg 




ir ro 


Val 


riie 


pi 

LjIU 


Asp 




































i nr 


Asp 


lie 


Asp 


\j in 


Ser 


Ala 




L?1U 


T~) -»~ /-~\ 

rlO 


pi n 

bin 


i nr 


Arg 


Leu 


lie 


lie 
















40 










45 








Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 




50 










55 










60 










Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


lie 


lie 


65 










70 










75 










80 


Ser 


Phe 


Glu 


Glu 


Met 


Asp 


Pro 


Pro 


Glu 


Asn 


He 


Asp 


Asp 


lie 


Gin 


Ser 










85 










90 










95 




Asp 


Leu 


lie 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 


Lys 


Met 


Glu 








100 










105 










110 






Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 


Cys 


Gin 


Lys 


Glu 






115 










120 










125 








Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Asn 


Gly 


Asp 




130 










135 










140 










Lys 


Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 
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( 8 ) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 471 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

( vi ) ORIGINAL SOURCE: 

( A) ORGANISM: human 
( G ) CELL TYPE: liver 

( ix) FEATURE: 

( A ) NAME / KEY : mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: E 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TAC TTT GGC AAG CTT GAA TCT AAA TTA TCA GTC ATA AG A A AT TTG AAT 48 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 

15 10 15 

GAC CAA GTT CTC TTC ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT 9 6 

Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

ATG ACT GAT TCT GAC TGT AGA GAT AAT GCA CCC CGG ACC ATA TTT ATT 144 
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Mot- 
Pit; L. 


Thr 


A C2 T~\ 




A c: n 




A rn 


/AO JkJ 

40 


Asn 


Ala 


Pro 


Arn 


Thr 
45 


He 


Phe 


He 




ATA 




ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


CCT 


AGA 


GGT 


ATG 


GCT 


GTA 


ACT 


ATC 


192 


lie 


Ser 
50 


Met: 


Tyr 


Lys 


Asp 


Ser 
55 


Gin 


Pro 


Arg 


Gly 


Met 
60 


Ala 


Val 


Thr 


He 




TCT 


GTG 


A AG 


TGT 


GAG 


AAA 


ATT 


TCA 


ACT 


CTC 


TCC 


TGT 


GAG 


AAC 


AAA 


ATT 


240 


Ser 


Val 


Lys 


Cys 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 




65 










70 










75 










80 




ATT 


TCC 


TTT 


AAG 


GAA 


ATG 


AAT 


CCT 


CCT 


GAT 


AAC 


ATC 


AAG 


GAT 


ACA 


AAA 


288 


He 


Ser 


Phe 


Lys 


Glu 
85 


Met 


Asn 


Pro 


Pro 


Asp 
90 


Asn 


He 


Lys 


Asp 


Thr 
95 


Lys 




AGT 


GAC 


ATC 


ATA 


TTC 


TTT 


CAG 


AGA 


AGT 


GTC 


CCA 


GGA 


CAT 


GAT 


AAT 


AAG 


336 


Ser 


Asp 


lie 


He 
100 

X w W 


Phe 


Phe 


Gin 


Arcr 


Ser 
105 


Val 


Pro 


Glv 

vj j_ jr 


His 


Asp 
110 


Asn 


Lys 




ATP 


CAA 


TTT 


r;AA 




TPA 


TPA 


TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TGT 


GAA 


384 


Met 


Gin 


Phe 
115 


Glu 


Ser 


Ser 


Ser 


Tyr 
120 


Glu 


Gly 


Tyr 


Phe 


Leu 
125 


Ala 


Cys 


Glu 




AAA 


GAG 


AGA 


GAC 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


AAA 


AAA 


GAG 


GAT 


GAA 


TTG 


432 


Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 






130 








135 










140 












GGG 


GAT 


AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


AAC 


GAA 


GAC 








471 


Gly 


Asp 


Arg 


Ser 


lie 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 











145 150 155 

(9) INFORMATION FOR SEQ ID NO: 9: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 11 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

( ii ) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser 
15 10 

(10) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 10 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: C- terminal fragment 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser He Met Phe Thr Val Gin Asn Glu Asp 
15 10 
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(11) INFORMATION FOR SEQ ID NO: 11: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 13 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg 
15 10 

(12) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 14 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: internal fragment 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Thr lie Phe lie lie Ser Met Tyr Lys Asp Ser Gin Pro Arg 
15 10 

(13) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 17 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii)MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: internal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

lie lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 
1 5 10 15 

( 14 ) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 471 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY: linear 
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(ii)MOLECULE TYPE: cDNA 

( ix) FEATURE: 

( A ) NAME/KEY : mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: S 



( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



TAP 


TTT 

L L J. 


PPP 


AAG 


CTT 


GAA 


TCT 


AAA 


TTA 


TCA 


GTC 


ATA 


AGA 


AAT 


TTG 


AAT 


48 


lyr 


php 
tr i lti 


PI \7 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arcr 


Asn 


Leu 


Asn 




i 
j_ 




5 










10 










15 






pap 


PA A 


PTT 

Oil 


PTP 


TTC 


ATT 


GAC 


CAA 


GGA 


AAT 


CGG 


CCT 


CTA 


TTT 


GAA 


GAT 


96 


jt\ CD 


Gin 


Val 


Leu. 


Phe 


lie 


Asp 


Gin 


Gly 

v-/ -i_ jr 


Asn 


Arq 


Pro 


Leu 


Phe 


Glu 


Asp 


















25 










30 








ATP 


APT 


GAT 


TCT 


GAC 


TCT 


AGA 


GAT 


AAT 


GCA 


ccc 


CGG 


ACC 


ATA 


TTT 


ATT 


144 


Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 
40 


Asn 


Ala 


Pro 


Arg 


Thr 
45 


He 


Phe 


He 




ATA 


APT 


ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


CCT 


AGA 


GGT 


ATG 


GCT 


GTA 


ACT 


ATC 


192 


lie 


Ser 
50 


Met 


Tvr 


Lys 


Asd 


Ser 
55 


Gin 


Pro 


Arg 


Gly 


Met 
60 


Ala 


Val 


Thr 


He 




TCT 


GTG 


AAG 


TCT 


GAG 


AAA 


ATT 


TCA 


ACT 


CTC 


TCC 


GCT 


GAG 


AAC 


AAA 


ATT 


240 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 




65 








70 










75 










80 




ATT 


TCC 


TTT 


AAG 


GAA 


ATG 


A AT 


CCT 


CCT 


GAT 


AAC 


ATC 


AAG 


GAT 


ACA 


AAA 


288 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 






AGT 


GAC 


ATC 


ATA 


TTC 


TTT 


CAG 


AGA 


AGT 


GTC 


CCA 


GGA 


CAT 


GAT 


AAT 


AAG 


336 


Ser 


Asp 


He 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 








105 










110 








ATG 


CAA 


TTT 


GAA 


TCT 


TCA 


TCA 


TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TGT 


GAA 


384 


Met 


Gin 


Phe 
115 


Glu 


Ser 


Ser 


Ser 


Tyr 
120 


Glu 


Gly 


Tyr 


Phe 


Leu 
125 


Ala 


Cys 


Glu 




AAA 


GAG 


AGA 


GAC 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


AAA 


AAA 


GAG 


GAT 


GAA 


TTG 


432 


Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 








135 










140 










471 


GGG 


GAT 


AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


AAC 


GAA 


GAC 








Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 










145 






150 










155 















(15) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 10 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii)MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser 
15 10 
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(16) INFORMATION FOR SEQ ID NO : 16: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 471 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

( A ) NAME/KEY : mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: S 



( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



1 


TTT 
1 ± 1 




nap 


PTT 


PA A 


TPT 

J. X 


AAA 


TTA 


TCA 


GTC 


ATA 


AGA 


AAT 


TTG 


AAT 


48 


i yr 


XT 1 


VJ7 _L_ Y 




Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


lie 


Ara 


Asn 


Leu 


Asn 




1 






5 










10 










15 






par 


PA A 


PTT 


PTP 


TTP 

J. J. \~s 


ATT 
n x x 


GAP 


CAA 


GGA 


AAT 


CGG 


CCT 


CTA 


TTT 


GAA 


GAT 


96 


Aon 


KJ JL 1 1 


v a jl 


JLj LI 


Phe 


Tip 

lie 


Asp 


Gin 


Gly 


Asn 


Arn 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 










APT 


PAT 


TPT 


PAP 


TPT 

J. v X 


APA 


GAT 


AAT 


GCA 


ccc 


CGG 


ACC 


ATA 


TTT 


ATT 


144 


Met 


Thr 


Asp 

35 


Ser 


Asp 


Ser 


Arg 


Asp 
40 


Asn 


Ala 


Pro 


Arg 


Thr 
45 


He 


Phe 


lie 




ATA 


AGT 


ATG 


TAT 


AAA 


GAT 


AGC 


CAG 


CCT 


AGA 


GGT 


ATG 


GCT 


GTA 


ACT 


ATC 


192 


lie 


Ser 
50 


Met 


Tyr 


Lys 


Asp 


Ser 
55 


Gin 


Pro 


Arg 


Gly 


Met 
60 


Ala 


Val 


Thr 


lie 




TCT 


GTG 


A AG 


TCT 


GAG 


AAA 


ATT 


TCA 


ACT 


CTC 


TCC 


GCT 


GAG 


AAC 


AAA 


ATT 


240 


Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


lie 




65 










70 










75 










80 




ATT 


TCC 


TTT 


AAG 


GAA 


ATG 


AAT 


CCT 


CCT 


GAT 


AAC 


ATC 


AAG 


GAT 


ACA 


AAA 


288 


lie 


Ser 


Phe 


Lys 


Glu 
85 


Met 


Asn 


Pro 


Pro 


Asp 
90 


Asn 


lie 


Lys 


Asp 


Thr 
95 


Lys 




AGT 


GAC 


ATC 


ATA 


TTC 


TTT 


CAG 


AGA 


AGT 


GTC 


CCA 


GGA 


CAT 


GAT 


AAT 


AAG 


336 


Ser 


Asp 


lie 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 








ATG 


CAA 


TTT 


GAA 


TCT 


TCA 


TCA 


TAC 


GAA 


GGA 


TAC 


TTT 


CTA 


GCT 


TCT 


GAA 


384 


Met 


Gin 


Phe 
115 


Glu 


Ser 


Ser 


Ser 


Tyr 
120 


Glu 


Gly 


Tyr 


Phe 


Leu 
125 


Ala 


Ser 


Glu 




AAA 


GAG 


AGA 


GAC 


CTT 


TTT 


AAA 


CTC 


ATT 


TTG 


AAA 


AAA 


GAG 


GAT 


GAA 


TTG 


432 


Lys 


Glu 
130 


Arg 


Asp 


Leu 


Phe 


Lys 
135 


Leu 


lie 


Leu 


Lys 


Lys 
140 


Glu 


Asp 


Glu 


Leu 




GGG 


GAT 


AGA 


TCT 


ATA 


ATG 


TTC 


ACT 


GTT 


CAA 


AAC 


GAA 


GAC 








471 


Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 










145 










150 










155 















( 17 ) INFORMATION FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH : 11464 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 
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(ii)MOLECULE TYPE: genomic DNA 



(vi) ORIGINAL SOURCE: 

A ) ORGANISM: human 
' G ) CELL TYPE: placenta 

( ix ) FEATURE : 

A ) NAME /KEY : 5' UTR 

B) LOCATION: 1. .3 

C ) IDENTIFICATION METHOD: E 

A ) NAME /KEY : leader peptide 

B) LOCATION: 4. .82 

C) IDENTIFICATION METHOD: S 
; A ) NAME/KEY : intron 

B) LOCATION: 83. . 1453 

C) IDENTIFICATION METHOD: E 

A ) NAME / KEY : leader peptide 

B) LOCATION: 1454. .1465 

C) IDENTIFICATION METHOD: S 

A ) NAME/KEY : intron 

B) LOCATION: 1466. .4848 
' C ) IDENTIFICATION METHOD: E 
' A ) NAME / KEY : leader peptide 

B) LOCATION: 4849. .4865 

C ) IDENTIFICATION METHOD: S 
;A)NAME/KEY: mat peptide 

B) LOCATION: 4866. .4983 

C) IDENTIFICATION METHOD: S 

A ) NAME/KEY : intron 

B) LOCATION: 4984. .6317 

C) IDENTIFICATION METHOD: E 

A) NAME/KEY: mat peptide 

B) LOCATION: 6318. .6451 

C) IDENTIFICATION METHOD: S 

A ) NAME/KEY : intron 

B) LOCATION: 6452. . 11224 

C) IDENTIFICATION METHOD: E 

A ) NAME /KEY : mat peptide 

B) LOCATION: 11225. .11443 

C) IDENTIFICATION METHOD: S 

A) NAME/KEY: 3' UTR 

B) LOCATION: 11444. .11464 

C ) IDENTIFICATION METHOD: E 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



AAG ATG GCT GCT GAA CCA GTA GAA GAC A AT TGC 
Met Ala Ala Glu Pro Val Glu Asp Asn Cys 
-35 -30 
ATG AAA TTT ATT GAC A AT ACG CTT TAC TTT ATA 
Met Lys Phe lie Asp Asn Thr Leu Tyr Phe lie 

-20 -15 
AGAACAAATA CCAGGTTCAG ATAAATCTAT TCAATTAGAA 
ATTAAGTGAC TCTTTGTGTC ACCAAATTTC ACTGTAATAT 
GTGGACCTCT AGAAATTAAC CACAACATGT CCAAGGTCTC 
GTCCTGGCAC TTTAATCAGC AGTAGCTCAC TCTCCAGTTG 



ATC AAC TTT GTG GCA 
lie Asn Phe Val Ala 
-25 

G GTAAGG CTAATGCCAT 
Ala 
-10 

AAGATGTTGT GAGGTGAACT 
TAATGGCTCT TAAAAAAATA 
AGCACCTTGT CACACCACGT 
GCAGTAAGTG CACATCATGA 



48 



98 



158 
218 
278 
338 
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AAATCCCAGT TTTCATGGGA AAATCCCAGT TTTCATTGGA TTTCCATGGG AAAAATCCCA 39 8 

GTACAAAACT GGGTGCATTC AGGAAATACA ATTTCCCAAA GCAAATTGGC AAATTATGTA 458 
AGAGATTCTC TAAATTTAGA GTTCCGTGAA TTACACCATT TTATGTAAAT ATGTTTGACA 518 
AGTAAAAATT GATTCTTTTT TTTTTTTTCT GTTGCCCAGG CTGGAGTGCA GTGGCACAAT 57 8 

CTCTGCTCAC TGCAACCTCC ACCTCCTGGG TTCAAGCAAT TCTCCTGCCT CAGCCTTCTG 63 8 
AGTAGCTGGG ACTACAGGTG CATCCCGCCA TGCCTGGCTA ATTTTTGGGT ATTTTTACTA 698 
GAGACAGGGT TTTGGCATGT TGTCCAGGCT GGTCTTGGAC TCCTGATCTC AGATGATCCT 7 58 

CCTGGCTCGG GCTCCCAAAG TGCTGGGATT ACAGGCATGA ACCACCACAC ATGGCCTAAA 818 
AATTGATTCT TATGATTAAT CTCCTGTGAA CAATTTGGCT TCATTTGAAA GTTTGCCTTC 878 
ATTTGAAACC TTCATTTAAA AGCCTGAGCA ACAAAGTGAG ACCCCATCTC TACAAAAAAC 938 
TGCAAAATAT CCTGTGGACA CCTCCTACCT TCTGTGGAGG CTGAAGCAGG AGGATCACTT 99 8 

GAGCCTAGGA ATTTGAGCCT GCAGTGAGCT ATGATCCCAC CCCTACACTC CAGCCTGCAT 1058 

GACAGTAGAC CCTGACACAC ACACACAAAA AAAAACCTTC ATAAAAAATT ATTAGTTGAC 1118 

TTTTCTTAGG TGACTTTCCG TTTAAGCAAT AAATTTAAAA GTAAAATCTC TAATTTTAGA 1178 

AAATTTATTT TTAGTTACAT ATTGAAATTT TTAAACCCTA GGTTTAAGTT TTATGTCTAA 1238 

ATTACCTGAG AACACACTAA GTCTGATAAG CTTCATTTTA TGGGCCTTTT GGATGATTAT 129 8 

ATAATATTCT GATGAAAGCC AAGACAGACC CTTAAACCAT AAAAATAGGA GTTCGAGAAA 13 58 

GAGGAGTAGC AAAAGTAAAA GCTAGAATGA GATTGAATTC TGAGTCGAAA T AC A A A A T T T 1418 

TACATATTCT GTTTCTCTCT TTTTCCCCCT CTTAG CT GAA GAT GAT G GTAAA 1470 

Ala Glu Asp Asp Glu 

-10 

GTAGAAATGA ATTTATTTTT CTTTGCAAAC TAAGTATCTG CTTGAGACAC ATCTATCTCA 1530 

CCATTGTCAG CTGAGGAAAA AAAAAAATGG TTCTCATGCT ACCAATCTGC CTTCAAAGAA 1590 

ATGTGGACTC AGTAGCACAG CTTTGGAATG AAGATGATCA TAAGAGATAC AAAGAAGAAC 16 50 

CTCTAGCAAA AGATGCTTCT CTATGCCTTA AAAAATTCTC CAGCTCTTAG AATCTACAAA 1710 

ATAGACTTTG CCTGTTTCAT TGGTCCTAAG ATTAGCATGA AGCCATGGAT TCTGTTGTAG 1770 

GGGGAGCGTT GCATAGGAAA A AGGGAT TGA AGCATTAGAA TTGTCCAAAA TCAGTAACAC 1830 

CTCCTCTCAG AAATGCTTTG GGAAGAAGCC TGGAAGGTTC CGGGTTGGTG GTGGGGTGGG 1890 

GCAGAAAATT CTGGAAGTAG AGGAGATAGG AATGGGTGGG GCAAGAAGAC CACATTCAGA 19 50 

GGCCAAAAGC TGAAAGAAAC CATGGCATTT ATGATGAATT CAGGGTAATT CAGAATGGAA 2010 

GTAGAGTAGG AGTAGGAGAC TGGTGAGAGG AGCTAGAGTG ATAAACAGGG TGTAGAGCAA 2070 

GACGTTCTCT CACCCCAAGA TGTGAAATTT GGACTTTATC TTGGAGATAA TAGGGTTAAT 2130 

TAAGCACAAT ATGTATTAGC TAGGGTAAAG ATTAGTTTGT TGTAACAAAG ACATCCAAAG 2190 

ATACAGTAGC TGAATAAGAT AGAGAATTTT TCTCTCAAAG AAAGTCTAAG TAGGCAGCTC 2250 

AGAAGTAGTA TGGCTGGAAG CAACCTGATG ATATTGGGAC CCCCAACCTT CTTCAGTCTT 2310 

GTACCCATCA TCCCCTAGTT GTTGATCTCA CTCACATAGT TGAAAATCAT CATACTTCCT 2370 

GGGTTCATAT CCCAGTTATC AAGAAAGGGT CAAGAGAAGT CAGGCTCATT CCTTTCAAAG 2430 

ACTCTAATTG GAAGTTAAAC ACATCAATCC CCCTCATATT CCATTGACTA GAATTTAATC 2490 

ACATGGCCAC ACCAAGTGCA AGGAAATCTG GAAAATATAA TCTTTATTCC AGGTAGCCAT 25 50 

ATGACTCTTT AAAATTCAGA AATAATATAT TTTTAAAATA TCATTCTGGC TTTGGTATAA 2 610 

AGAATTGATG GTGTGGGGTG AGGAGGCCAA AATTAAGGGT TGAGAGCCTA TTATTTTAGT 2 670 

TATTACAAGA AATGATGGTG TCATGAATTA AGGTAGACAT AGGGGAGTGC TGATGAGGAG 2730 

CTGTGAATGG ATTTTAGAAA CACTTGAGAG AATCAATAGG ACATGATTTA GGGTTGGATT 2790 

TGGAAAGGAG AAGAAAGTAG AAAAGATGAT GCCTACATTT TTCACTTAGG CAATTTGTAC 2850 

CATTCAGTGA AATAGGGAAC ACAGGAGGAA GAGCAGGTTT TGGTGTATAC AAAGAGGAGG 2910 

ATGGATGACG CATTTCGTTT TGGATCTGAG ATGTCTGTGG AACGTCCTAG TGGAGATGTC 29 70 

CACAAACTCT TCTACATGTG GTTCTGAGTT CAGGACACAG ATTTGGGCTG GAGATAGAGA 3030 

TATTGTAGGC TTATACATAG AAATGGCATT TGAATCTATA GAGATAAAAA GACACATCAG 3090 

AGGAAATGTG TAAAGTGAGA GAGGAAAAGC CAAGTACTGT GCTGGGGGGA ATACCTACAT 3150 

TTAAAGGATG CAGTAGAAAG AAGCTAATAA ACAACAGAGA GCAGACTAAC CAAAAGGGGA 3210 

GAAGAAAAAC CAAGAGAATT CCACCGACTC CCAGGAGAGC ATTTCAAGAT TGAGGGGATA 3270 

GGTGTTGTGT TGAATTTTGC AGCCTTGAGA ATCAAGGGCC AGAACACAGC TTTTAGATTT 3330 

AGCAACAAGG AGTTTGGTGA TCTCAGTGAA AGCAGCTTGA TGGTGAAATG GAGGCAGAGG 3390 

CAGATTGCAA TGAGTGAAAC AGTGAATGGG AAGTGAAGAA ATGATACAGA TAATTCTTGC 3450 

TAAAAGCTTG GCTGTTAAAA GGAGGAGAGA AACAAGACTA GCTGCAAAGT GAGATTGGGT 3 510 

TGATGGAGCA GTTTTAAATC TCAAAATAAA GAGCTTTGTG CTTTTTTGAT TATGAAAATA 3 570 
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ATGTGTTAAT TGTAACTAAT TGAGGCAATG AAAAAAGATA ATAATATGAA AGATAAAAAT 3 630 

ATAAAAACCA CCCAGAAATA ATGATAGCTA CCATTTTGAT ACAATATTTC TACACTCCTT 3690 

TCTATGTATA TATACAGACA CAGAAATGCT TATATTTTTA TTAAAAGGGA TTGTACTATA 37 50 

CCTAAGCTGC TTTTTCTAGT TAGTGATATA TATGGACATC TCTCCATGGC AACGAGTAAT 3810 

TGCAGTT^TA TTAAGTTCAT GATATTTCAC AATAAGGGCA TATCTTTGCC CTTTTTATTT 3870 

AATCAATTCT TAATTGGTGA ATGTTTGTTT CCAGTTTGTT GTTGTTATTA ACAATGTTCC 3930 

CATAAGCATT CCTGTACACC AATGTTCACA CATTTGTCTG ATTTTTTCTT CAGGATAAAA 3990 

CCCAGGAGGT AGAATTGCTG GGTTGATAGA AGAGAAAGGA TGATTGCCAA ATTAAAGCTT 4050 

CAGTAGAGGG TACATGCCGA GCACAAATGG GATCAGCCCT AGATACCAGA AATGGCACTT 4110 

TCTCATTTCC CCTTGGGACA AAAGGGAGAG AGGCAATAAC TGTGCTGCCA GAGTTAAATT 4170 

TGTACGTGGA GTAGCAGGAA ATCATTTGCT GAAAATGAAA ACAGAGATGA TGTTGTAGAG 4230 

GTCCTGAAGA GAGCAAAGAA AATTTGAAAT TGCGGCTATC AGCTATGGAA GAGAGTGCTG 4290 

AACTGGAAAA CAAAAGAAGT ATTGACAATT GGTATGCTTG TAATGGCACC GATTTGAACG 4350 

CTTGTGCCAT TGTTCACCAG CAGCACTCAG CAGCCAAGTT TGGAGTTTTG TAGCAGAAAG 4410 

ACAAATAAGT TAGGGATTTA ATATCCTGGC CAAATGGTAG ACAAAATGAA CTCTGAGATC 4470 

CAGCTGCACA GGGAAGGAAG GGAAGACGGG AAGAGGTTAG ATAGGAAATA CAAGAGTCAG 4530 

GAGACTGGAA GATGTTGTGA TATTTAAGAA CACATAGAGT TGGAGTAAAA GTGTAAGAAA 4 5 90 

ACTAGAAGGG TAAGAGACCG GTCAGAAAGT AGGCTATTTG AAGTTAACAC TTCAGAGGCA 4650 

GAGTAGTTCT GAATGGTAAC AAGAAATTGA GTGTGCCTTT GAGAGTAGGT TAAAAAACAA 4710 

TAGGCAACTT TATTGTAGCT ACTTCTGGAA CAGAAGATTG TCATTAATAG TTTTAGAAAA 4770 

CTAAAATATA TAGCATACTT ATTTGTCAAT TAACAAAGAA ACTATGTATT TTTAAATGAG 4830 

ATTTAATGTT TATTGTAG AA AAC CTG GAA TCA GAT TAC TTT GGC AAG CTT 4880 

Glu Asn Leu Glu Ser Asp Tyr Phe Gly Lys Leu 
-5 15 

GAA TCT AAA TTA TCA GTC ATA AGA AAT TTG A AT GAC CAA GTT CTC TTC 4928 
Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn Asp Gin Val Leu Phe 

10 15 20 

ATT GAC CAA GGA AAT CGG CCT CTA TTT GAA GAT ATG ACT GAT TCT GAC 4976 
lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp Met Thr Asp Ser Asp 

25 30 35 

TGT AGA G GTATTTTTT TTAATTCGCA AACATAGAAA TGACTAGCTA CTTCTTCCCA 5032 
Cys Arg Asp 
40 

TTCTGTTTTA CTGCTTACAT TGTTCCGTGC TAGTCCCAAT CCTCAGATGA AAAGTCACAG 509 2 

GAGTGACAAT AATTTCACTT ACAGGAAACT TTATAAGGCA TCCACGTTTT TTAGTTGGGG 5152 

TAAAAAATTG GATACAATAA GACATTGCTA GGGGTCATGC CTCTCTGAGC CTGCCTTTGA 5212 

ATCACCAATC CCTTTATTGT GATTGCATTA ACTGTTTAAA ACCTCTATAG TTGGATGCTT 5272 

AATCCCTGCT TGTTACAGCT GAAAATGCTG ATAGTTTACC AGGTGTGGTG GCATCTATCT 5332 

GTAATCCTAG CTACTTGGGA GGCTCAAGCA GGAGGATTGC TTGAGGCCAG GACTTTGAGG 5392 

CTGTAGTACA CTGTGATCGT ACCTGTGAAT AGCCACTGCA CTCCAGCCTG GGTGATATAC 5452 

AGACCTTGTC TCTAAAATTA AAAAAAAAAA AAAAAAAAAC CTTAGGAAAG GAAATTGATC 5 512 

AAGTCTACTG TGCCTTCCAA AACATGAATT CCAAATATCA AAGTTAGGCT GAGTTGAAGC 5 572 

AGTGAATGTG CATTCTTTAA AAATACTGAA TACTTACCTT AACATATATT TTAAATATTT 5 632 

TATTTAGCAT TTAAAAGTTA AAAACAATCT TTTAGAATTC ATATCTTTAA AATACTCAAA 5692 

AAAGTTGCAG CGTGTGTGTT GTAATACACA TTAAACTGTG GGGTTGTTTG TTTGTTTGAG 5752 

ATGCAGTTTC ACTCTGTCAC CCAGGCTGAA GTGCAGTGCA GTGCAGTGGT GTGATCTCGG 5812 

CTCACTACAA CCTCCACCTC CCACGTTCAA GCGATTCTCA TGCCTCAGTC TCCCGAGTAG 5872 

GTGGGATTAC AGGCATGCAC CACTTACACC CGGCTAATTT TTGTATTTTT AGTAGAGCTG 5932 

GGGTTTCACC ATGTTGGCCA GGCTGGTCTC AAACCCCTAA CCTCAAGTGA TCTGCCTGCC 5992 

TCAGCCTCCC AAACAAACAA ACAACCCCAC AGTTTAATAT GTGTTACAAC ACACATGCTG 6052 

CAACTTTTAT GAGTATTTTA ATGATATAGA TTATAAAAGG TTGTTTTTAA CTTTTAAATG 6112 

CTGGGATTAC AGGCATGAGC CACTGTGCCA GGCCTGAACT GTGTTTTTAA AAATGTCTGA 6172 

CCAGCTGTAC ATAGTCTCCT GCAGACTGGC CAAGTCTCAA AGTGGGAACA GGTGTATTAA 6232 

GGACTATCCT TTGGTTAAAT TTCCGCAAAT GTTCCTGTGC AAGAATTCTT CTAACTAGAG 629 2 

TTCTCATTTA TTATATTTAT TTCAG AT AAT GCA CCC CGG ACC ATA TTT ATT 6343 

Asp Asn Ala Pro Arg Thr lie Phe lie 
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40 45 

ATA AGT ATG TAT AAA GAT AGC CAG CCT AGA GGT ATG GCT GTA ACT ATC 63 91 
lie Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr lie 

50 55 60 

TCT GTG AAG TGT GAG AAA ATT TCA ACT CTC TCC TGT GAG AAC AAA ATT 6439 
Ser Val Lys Cys Glu Lys lie Ser Thr Leu Ser Cys Glu Asn Lys lie 
65 70 75 80 

ATT TCC TTT AAG GTAAG ACTGAGCCTT ACTTTGTTTT CAATCATGTT AATATAATCA 649 6 
lie Ser Phe Lys 

ATATAATTAG AAATATAACA TTATTTCTAA TGTTAATATA AGTAATGTAA TTAGAAAACT 6556 

CAAATATCCT CAGACCAACC TTTTGTCTAG AACAGAAATA ACAAGAAGCA GAGAACCATT 6616 

AAAGTGAATA CTTACTAAAA ATTATCAAAC TCTTTACCTA TTGTGATAAT GATGGTTTTT 6 67 6 

CTGAGCCTGT CACAGGGGAA GAGGAGATAC AACACTTGTT TTATGACCTG CATCTCCTGA 673 6 

ACAATCAGTC TTTATACAAA TAATAATGTA GAATACATAT GTGAGTTATA CATTTAAGAA 679 6 

TAACATGTGA CTTTCCAGAA TGAGTTCTGC TATGAAGAAT GAAGCTAATT ATCCTTCTAT 68 5 6 

ATTTCTACAC CTTTGTAAAT TATGATAATA TTTTAATCCC TAGTTGTTTT GTTGCTGATC 6916 

CTTAGCCTAA GTCTTAGACA CAAGCTTCAG CTTCCAGTTG ATGTATGTTA TTTTTAATGT 697 6 

TAATCTAATT GAATAAAAGT TATGAGATCA GCTGTAAAAG TAATGCTATA ATTATCTTCA 703 6 

AGCCAGGTAT AAAGTATTTC TGGCCTCTAC TTTTTCTCTA TTATTCTCCA TTATTATTCT 709 6 

CTATTATTTT TCTCTATTTC CTCCATTATT GTTAGATAAA CCACAATTAA CTATAGCTAC 715 6 

AGACTGAGCC AGTAAGAGTA GCCAGGGATG CTTACAAATT GGCAATGCTT CAGAGGAGAA 7216 

TTCCATGTCA TGAAGACTCT TTTTGAGTGG AGATTTGCCA ATAAATATCC GCTTTCATGC 7276 

CCACCCAGTC CCCACTGAAA GACAGTTAGG ATATGACCTT AGTGAAGGTA CCAAGGGGCA 733 6 

ACTTGGTAGG GAGAAAAAAG CCACTCTAAA ATATAATCCA AGTAAGAACA GTGCATATGC 7396 

AACAGATACA GCCCCCAGAC AAATCCCTCA GCTATCTCCC TCCAACCAGA GTGCCACCCC 7456 

TTCAGGTGAC AATTTGGAGT CCCCATTCTA GACCTGACAG GCAGCTTAGT TATCAAAATA 7516 

GCATAAGAGG CCTGGGATGG AAGGGTAGGG TGGAAAGGGT TAAGCATGCT GTTACTGAAC 7 57 6 

AACATAATTA GAAGGGAAGG AGATGGCCAA GCTCAAGCTA TGTGGGATAG AGGAAAACTC 7 63 6 

AGCTGCAGAG GCAGATTCAG AAACTGGGAT AAGTCCGAAC CTACAGGTGG ATTCTTGTTG 7 69 6 

AGGGAGACTG GTGAAAATGT TAAGAAGATG GAAATAATGC TTGGCACTTA GTAGGAACTG 77 5 6 

GGCAAATCCA TATTTGGGGG AGCCTGAAGT TTATTCAATT TTGATGGCCC TTTTAAATAA 7816 

AAAGAATGTG GCTGGGCGTG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGGCCGAG 7876 

GGGGGCGGAT CACCTGAAGT CAGGAGTTCA AGACCAGCCT GACCAACATG GAGAAACCCC 7936 

ATCTCTACTA A A A A T AC A A A ATTAGCTGGG CGTGGTGGCA TATGCCTGTA ATCCCAGCTA 799 6 

CTCGGGAGGC TGAGGCAGGA GAATCTTTTG AACCCGGGAG GCAGAGGTTG CGATGAGCCT 805 6 

AGATCGTGCC ATTGCACTCC AGCCTGGGCA ACAAGAGCAA AACTCGGTCT CAAAAAAAAA 8116 

AAAAAAAAAG TGAAATTAAC CAAAGGCATT AGCTTAATAA TTTAATACTG TTTTTAAGTA 817 6 

GGGCGGGGGG TGGCTGGAAG AGATCTGTGT AAATGAGGGA ATCTGACATT TAAGCTTCAT 823 6 

CAGCATCATA GCAAATCTGC TTCTGGAAGG AACTCAATAA ATATTAGTTG GAGGGGGGGA 829 6 

GAGAGTGAGG GGTGGACTAG GACCAGTTTT AGCCCTTGTC TTTAATCCCT TTTCCTGCCA 8356 

CTAATAAGGA TCTTAGCAGT GGTTATAAAA GTGGCCTAGG TTCTAGATAA TAAGATACAA 8416 

CAGGCCAGGC ACAGTGGCTC ATGCCTATAA TCCCAGCACT TTGGGAGGGC AAGGCGAGTG 847 6 

TCTCACTTGA GATCAGGAGT TCAAGACCAG CCTGGCCAGC ATGGCGATAC TCTGTCTCTA 8 53 6 

CTAAAAAAAA TACAAAAATT AGCCAGGCAT GGTGGCATGC ACCTGTAATC CCAGCTACTC 859 6 

GTGAGCCTGA GGCAGAAGAA TCGCTTGAAA CCAGGAGGTG TAGGCTGCAG TGAGCTGAGA 8 6 56 

TCGCACCACT GCACTCCAGC CTGGGCGACA GAATGAGACT TTGTCTCAAA AAAAGAAAAA 8716 

GATACAACAG GCTACCCTTA TGTGCTCACC TTTCACTGTT GATTACTAGC TATAAAGTCC 877 6 

TATAAAGTTC TTTGGTCAAG AACCTTGACA ACACTAAGAG GGATTTGCTT TGAGAGGTTA 8 83 6 

CTGTCAGAGT CTGTTTCATA TATATACATA TACATGTATA TATGTATCTA TATCCAGGCT 8 89 6 

TGGCCAGGGT TCCCTCAGAC TTTCCAGTGC ACTTGGGAGA TGTTAGGTCA ATATCAACTT 895 6 

TCCCTGGATT CAGATTCAAC CCCTTCTGAT GTAAAAAAAA AAAAAAAAAA GAAAGAAATC 9016 

CCTTTCCCCT TGGAGCACTC AAGTTTCACC AGGTGGGGCT TTCCAAGTTG GGGGTTCTCC 907 6 

AAGGTCATTG GGATTGCTTT CACATCCATT TGCTATGTAC CTTCCCTATG ATGGCTGGGA 913 6 

GTGGTCAACA TCAAAACTAG GAAAGCTACT GCCCAAGGAT GTCCTTACCT CTATTCTGAA 919 6 

ATGTGCAATA AGTGTGATTA AAGAGATTGC CTGTTCTACC TATCCACACT CTCGCTTTCA 9256 

ACTGTAACTT TCTTTTTTTC TTTTTTTCTT TTTTTCTTTT TTTTTGAAAC GGAGTCTCGC 9316 
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TCTGTCGCCC AGGCTAGAGT GCAGTGGCAC GATCTCAGCT CACTGCAAGC TCTGCCTCCC 
GGGTTCACGC CATTCTCCTG CCTCACCCTC CCAAGCAGCT GGGACTACAG GCGCCTGCCA 
CCATGCCCAG CTAATTTTTT GTATTTTTAG TAGAGACGGG GTTTCACCGT GTTAGCCAGG 
ATGGTCTCGA TCTCCTGAAC TTGTGATCCG CCCGCCTCAG CCTCCCAAAG TGCTGGGATT 
ACAGGCGTGA GCCATCGCAC CCGGCTCAAC TGTAACTTTC TATACTGGTT CATCTTCCCC 
TGTAATGTTA CTAGAGCTTT TGAAGTTTTG GCTATGGATT ATTTCTCATT TATACATTAG 
ATTTCAGATT AGTTCCAAAT TGATGCCCAC AGCTTAGGGT CTCTTCCTAA ATTGTATATT 
GTAGACAGCT GCAGAAGTGG GTGCCAATAG GGGAACTAGT TTATACTTTC ATCAACTTAG 
GACCCACACT TGTTGATAAA GAACAAAGGT CAAGAGTTAT GACTACTGAT TCCACAACTG 
ATTGAGAAGT TGGAGATAAC CCCGTGACCT CTGCCATCCA GAGTCTTTCA GGCATCTTTG 
AAGGATGAAG AAATGCTATT TTAATTTTGG AGGTTTCTCT ATCAGTGCTT AGGATCATGG 
GAATCTGTGC TGCCATGAGG CCAAAATTAA GTCCAAAACA TCTACTGGTT CCAGGATTAA 
CATGGAAGAA CCTTAGGTGG TGCCCACATG TTCTGATCCA TCCTGCAAAA TAGACATGCT 
GCACTAACAG GAAAAGTGCA GGCAGCACTA CCAGTTGGAT AACCTGCAAG ATTATAGTTT 
CAAGTAATCT AACCATTTCT CACAAGGCCC TATTCTGTGA CTGAAACATA CAAGAATCTG 
CATTTGGCCT TCTAAGGCAG GGCCCAGCCA AGGAGACCAT ATTCAGGACA GAAATTCAAG 
ACTACTATGG AACTGGAGTG CTTGGCAGGG AAGACAGAGT CAAGGACTGC CAACTGAGCC 
AATACAGCAG GCTTACACAG GAACCCAGGG CCTAGCCCTA CAACAATTAT TGGGTCTATT 
CACTGTAAGT TTTAATTTCA GGCTCCACTG AAAGAGTAAG CTAAGATTCC TGGCACTTTC 
TGTCTCTCTC ACAGTTGGCT CAGAAATGAG AACTGGTCAG GCCAGGCATG GTGGCTTACA 
CCTGGAATCC CAGCACTTTG GGAGGCCGAA GTGGGAGGGT CACTTGAGGC CAGGAGTTCA 
GGACCAGCTT AGGCAACAAA GTGAGATACC CCCTGACCCC TTCTCTACAA AAATAAATTT 
TAAAAATTAG CCAAATGTGG TGGTGTATAC TTACAGTCCC AGCTACTCAG GAGGCTGAGG 
CAGGGGGATT GCTTGAGCCC AGGAAT T C A A GGCTGCAGTG AGCTATGATT TCACCACTGC 
ACTTCTGGCT GGGCAACAGA GCGAGACCCT GTCTCAAAGC AAAAAGAAAA AGAAACTAGA 
ACTAGCCTAA GTTTGTGGGA GGAGGTCATC ATCGTCTTTA GCCGTGAATG GTTATTATAG 
AGGACAGAAA TTGACATTAG CCCAAAAAGC TTGTGGTCTT TGCTGGAACT CTACTTAATC 
TTGAGCAAAT GTGGACACCA CTCAATGGGA GAGGAGAGAA GTAAGCTGTT TGATGTATAG 
GGGAAAACTA GAGGCCTGGA ACTGAATATG CATCCCATGA CAGGGAGAAT AGGAGATTCG 
GAGTTAAGAA GGAGAGGAGG TCAGTACTGC TGTTCAGAGA TTTTTTTTAT GTAACTCTTG 
AGAAGCAAAA CTACTTTTGT TCTGTTTGGT AATATACTTC AAAACAAACT TCATATATTC 
AAATTGTTCA TGTCCTGAAA TAATTAGGTA ATGTTTTTTT CTCTATAG GAA ATG A AT 

Glu Met Asn 
85 

CCT CCT GAT AAC ATC A AG GAT ACA AAA AGT GAC ATC ATA TTC TTT CAG 
Pro Pro Asp Asn lie Lys Asp Thr Lys Ser Asp lie lie Phe Phe Glu 

90 95 100 

AGA AGT GTC CCA GGA CAT GAT AAT AAG ATG CAA TTT GAA TCT TCA TCA 
Arg Ser Val Pro Gly His Asp Asn Lys Met Gin Phe Glu Ser Ser Ser 

105 110 115 

TAC GAA GGA TAC TTT CTA GCT TGT GAA AAA GAG AGA GAC CTT TTT AAA 
Tyr Glu Gly Tyr Phe Leu Ala Cys Glu Lys Glu Arg Asp Leu Phe Lys 
120 125 130 135 

CTC ATT TTG AAA AAA GAG GAT GAA TTG GGG GAT AGA TCT ATA ATG TTC 
Leu lie Leu Lys Lys Glu Asp Glu Leu Gly Asp Arg Ser lie Met Phe 

140 145 150 

ACT GTT CAA AAC GAA GAC TAGCTATTAA AATTTCATGC C 
Thr Val Gin Asn Glu Asp 
155 



9376 
9436 
9496 
9556 
9616 
9676 
9736 
9796 
9856 
9916 
9976 
10036 
10096 
10156 
10216 
10276 
10336 
10396 
10456 
10516 
10576 
10636 
10696 
10756 
10816 
10876 
10936 
10996 
11056 
11116 
11176 
11233 



11281 



11329 



11377 



11425 



11464 



(18) INFORMATION FOR SEQ ID NO: 18: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 471 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 
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( D ) TOPOLOGY : linear 

( ii )MOLECULE TYPE: cDNA to mRNA 

(vi) ORIGINAL SOURCE: 

( A ) ORGANISM : mouse 
( G ) CELL TYPE: liver 

( ix) FEATURE: 

( A ) NAME / KEY : mat peptide 

(B) LOCATION: 1. .471 

(C) IDENTIFICATION METHOD: S 



( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



AAC 


TTT 


GGC 


CGA 


CTT 


CAC 


TGT 


ACA 


ACC 


GCA 


GTA 


ATA 


CGG 


AAT 


ATA 


AAT 


48 


Asn 


Phe 


Glv 


Aro 


Leu 


His 


Cys 


Thr 


Thr 


Ala 


Val 


lie 


Arq 


Asn 


lie 


Asn 




1 






5 










10 










15 






GAC 


CAA 


GTT 


CTC 


TTC 


GTT 


GAC 


AAA 


AGA 


CAG 


CCT 


GTG 


TTC 


GAG 


GAT 


ATG 


96 


Asp 


Gin 


Val 


Leu 


Phe 


Val 


Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 








20 










25 










30 








ACT 


GAT 


ATT 


GAT 


CAA 


AGT 


GCC 


AGT 


GAA 


CCC 


CAG 


ACC 


AGA 


CTG 


ATA 


ATA 


144 


Thr 


Asp 


lie 


Asp 


Gin 


Ser 


Ala 


Ser 


Glu 


Pro 


Gin 


Thr 


Arq 


Leu 


lie 


lie 






35 










40 










45 










TAC 


ATG 


TAC 


AAA 


GAC 


AGT 


GAA 


GTA 


AGA 


GGA 


CTG 


GCT 


GTG 


ACC 


CTC 


TCT 


192 


Tyr 


Met 
50 


Tyr 


Lys 


Asp 


Ser 


Glu 
55 


Val 


Arg 


Gly 


Leu 


Ala 
60 


Val 


Thr 


Leu 


Ser 




GTG 


AAG 


GAT 


AGT 


AAA 


ATG 


TCT 


ACC 


CTC 


TCC 


TGT 


AAG 


AAC 


AAG 


ATC 


ATT 


240 


Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


lie 


lie 




65 










70 










75 










80 




TCC 


TTT 


GAG 


GAA 


ATG 


GAT 


CCA 


CCT 


GAA 


AAT 


ATT 


GAT 


GAT 


ATA 


CAA 


AGT 


288 


Ser 


Phe 


Glu 


Glu 


Met 


Asp 


Pro 


Pro 


Glu 


Asn 


He 


Asp 


Asp 


He 


Gin 


Ser 












85 








90 










95 






GAT 


CTC 


ATA 


TTC 


TTT 


CAG 


AAA 


CGT 


GTT 


CCA 


GGA 


CAC 


AAC 


AAG 


ATG 


GAG 


336 


Asp 


Leu 


lie 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 


Lys 


Met 


Glu 








100 










105 










110 








TTT 


GAA 


TCT 


TCA 


CTG 


TAT 


GAA 


GGA 


CAC 


TTT 


CTT 


GCT 


TGC 


CAA 


AAG 


GAA 


384 


Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 


Cys 


Gin 


Lys 


Glu 








115 








120 










125 










GAT 


GAT 


GCT 


TTC 


AAA 


CTC 


ATT 


CTG 


AAA 


AAA 


AAG 


GAT 


GAA 


AAT 


GGG 


GAT 


432 


Asp 


Asp 
130 


Ala 


Phe 


Lys 


Leu 


He 
135 


Leu 


Lys 


Lys 


Lys 


Asp 
140 


Glu 


Asn 


Gly 


Asp 




AAA 


TCT 


GTA 


ATG 


TTC 


ACT 


CTC 


ACT 


AAC 


TTA 


CAT 


CAA 


AGT 








471 


Lys 


Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 










145 










150 










155 















(19) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 9 amino acids 

( B ) TYPE : amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N- terminal fragment 
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0 # 



( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Asn Phe Gly Arg Leu His Cys Thr Thr 

1 5 

(20) INFORMATION FOR SEQ ID NO: 20: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Tyr 


Phe 


Gly Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 






5 










10 










15 




Asp 


Gin 


Val Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 




Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp Ser 


Asp 


Cys 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 








40 










45 








lie 


Ser 


Met Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 








55 










60 










Ser 


Val 


Lys Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


He 


65 








70 










75 










80 


lie 


Ser 


Phe Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


noi i 


Tip 


J— i_y o 


A c n 

/AO fc> 


Thr 










85 










90 
















Asp 


He lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


nu 


CI 1 X7 


n j_ o 




A en 


T.T7C 

JL 






100 










105 










1 1 O 

J_ X \J 








Gin 


Phe Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


iyr 


Xr 1 lti 


Lit; U 


Al = 
nl d 










115 








120 










Jl ^ -D 








Lys 


Glu 


Arg Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


T ,\7Q 

J— i_y o 


i~t y o 




A en 


OJ.U 


T ,D1 1 
Li. 




130 








135 










X ft VJ 










Gly 


Asp 


Arg Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 








150 










155 












(21) 


INFORMATION FOR SEQ ID 


NO: 


21: 


















(i) SEQUENCE CHARACTERISTICS: 




















(A) LENGTH: 157 amino 


acids 


















(B) TYPE: 


amino acid 






















(D) TOPOLOGY: 


linear 




















(ii) MOLECULE 


TYPE: peptide 


















(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 


21: 










Tyr 


Phe 


Gly Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 






5 










10 










15 




Asp 


Gin 


Val Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 






20 










25 










30 






Met 


Thr 


Asp Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 








40 










45 








He 


Ser 


Met Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 








55 










60 
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V61 


Lys 


O -r- 


oil* 


Lys 


x xe 


C t- 


i nr 


Leu 


O r~\ -r~- 


uys 


pin 


Asn 


Lys lie 


D 3 










/ U 




















Tip 


Or^ -r- 


Php 


T T7C 
-*— CD 


Vj7 _I_ Li 




Aon 




Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr Lys 










o o 










on 












O J- 


A ^ n 
^ ^ 


Tip 

-L -L " 


Tip* 

-L _l_ tz; 


Phe 


Phe 


Gin 


Arn 


Ser 


Val 


Pro 


Glv 


His 


Asp 










i nn 










1 DR 
J. w -J 










110 
j — 




Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys Glu 






115 










120 










125 






Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu Leu 




130 










135 










140 








Gly 


Asp 


Arg 


Ser 


lie 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 






145 










150 










155 











(22) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Cys 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 




He 


65 










70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 












(23) 


INFORMATION FOR SEQ ID 


NO: 


23: 

















( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


lie 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Ara 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Ara 


Asd 


Asn 


Ala 


Pro 


Ara 


Thr 


He 


Phe 


lie 






35 










40 










45 








lie 


Ser 


Met 


Tvr 


Lys 


Asp 


Ser 


Gin 


Pro 


Ara 


Glv 

v_/ -i- jr 


Met 


Ala 


Val 


Thr 


lie 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


lie 


Ser 


Thr 


Leu 


Ser 


Cys 


Glu 


Asn 


Lys 


lie 


65 










70 










75 










80 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


lie 


Lys 


Asp 


Thr 


Lvs 










85 










90 










95 




Ser 


Asp 


lie 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


ASD 


Asn 


Lys 








100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 










150 










155 













(24) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


He 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


Phe 


Glu 


Asp 








20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








He 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 










55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ser 


Glu 


Asn 


Lys 


He 


65 










70 










75 










80 


He 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 










85 










90 










95 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 








100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 




130 










135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 









145 150 155 

(25) INFORMATION FOR SEQ ID NO: 25: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


v ax 


Tic* 

lie 


Arg 


noil 




A on 

/A Oil 


1 




5 










i n 
x u 










15 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Cj±y 


Asn 


Arg 


iri U 


Leu 


i i it; 


V_7 _L \JL 


Asp 






ZU 




















30 






Met 


Thr 


Asp 


Ser 


Asp 


ber 


Arg 


Asp 


Asn 


M.J- d 


Jr -L LJ 




X i 1JL 


He 


Phe 


He 






35 










4t u 


















lie 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 








55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 


65 








70 










75 










80 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


/A CD p 


r\ o 1 1 


Tip 


Lys 


Asp 


Thr 


Lys 








85 










90 










Q S 




Ser 


Asp 


He 


He 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


"D -r~ r~\ 




ii _L O 


B en 

/A O fcJ 


r\ ^ 1 1 


Lys 






100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Cys 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 


130 








135 










140 










Gly 


Asp 


Arg 


Ser 


He 


Met 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Asp 








145 






150 










i ^ ^ 
1 3 j 












(26) 


INFORMATION FOR SEQ ID 


NO: 


26: 


















(i) SEQUENCE CHARACTERISTICS: 




















(A) LENGTH: 157 amino 


acids 


















(B) TYPE: 


amino acid 






















(D) TOPOLOGY: 


linear 




















(ii) MOLECULE 


TYPE: peptide 


















(xi) SEQUENCE 


DESCRIPTION 


• SEQ ID 


NO : 


ZD : 










Tyr 


Phe 


Gly 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Ser 


Val 


x xe 


Arg 


Asn 


Leu 


Asn 


1 








5 










10 










i ^ 

X D 




Asp 


Gin 


Val 


Leu 


Phe 


He 


Asp 


Gin 


Gly 


Asn 


Arg 


Pro 


Leu 


TD q 


pi n 
(jlli 


Asp 






20 










25 










30 






Met 


Thr 


Asp 


Ser 


Asp 


Ser 


Arg 


Asp 


Asn 


Ala 


Pro 


Arg 


Thr 


He 


Phe 


He 






35 










40 










45 








lie 


Ser 


Met 


Tyr 


Lys 


Asp 


Ser 


Gin 


Pro 


Arg 


Gly 


Met 


Ala 


Val 


Thr 


He 




50 






55 










60 










Ser 


Val 


Lys 


Ser 


Glu 


Lys 


He 


Ser 


Thr 


Leu 


Ser 


Ala 


Glu 


Asn 


Lys 


He 


65 








70 










75 










80 


lie 


Ser 


Phe 


Lys 


Glu 


Met 


Asn 


Pro 


Pro 


Asp 


Asn 


He 


Lys 


Asp 


Thr 


Lys 








85 










90 










95 




Ser 


Asp 


He 


lie 


Phe 


Phe 


Gin 


Arg 


Ser 


Val 


Pro 


Gly 


His 


Asp 


Asn 


Lys 






100 










105 










110 






Met 


Gin 


Phe 


Glu 


Ser 


Ser 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Phe 


Leu 


Ala 


Ser 


Glu 






115 










120 










125 








Lys 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Leu 


He 


Leu 


Lys 


Lys 


Glu 


Asp 


Glu 


Leu 
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130 135 140 

Gly Asp Arg Ser lie Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

(27) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 





(ii) MOLECULE 


TYPE 


;: peptide 


















( xi ) SEQUENCE 


DESCRIPTION: 


SEQ ID 


InU * 


z / . 










Asn 


Phe 


Gly 


Arg 


Leu 


His 


Ala 


Thr 


Thr 


Ala 


vai 


lie 


Arg 


Asn 


lie 


Asn 


1 




5 










10 










1 D 




Asp 


Gin 


Val 


Leu 


Phe 


Val 


Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 






20 










25 










30 






rp V-v -r~ 

1 nr 


Asp 


He 


Asp 


Gin 


Ser 


Ala 


Ser 


Glu 


Pro 




J. 1 1 JL 


A TTT 


Leu 


lie 


He 




35 










40 










45 








Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 


50 










55 










60 










Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


He 


He 


65 






70 










75 










80 


Ser 


Phe 


Glu 


Glu 


Met 


Asp 


Pro 


Pro 


Glu 


Asn 


lie 


Asp 


Asp 


lie 


Gin 


Ser 










85 










90 










95 




Asp 


Leu 


He 


Phe 


Phe 


Gin 


Lys 


Arg 


Val 


Pro 


Gly 


His 


Asn 


Lys 


Met 


Glu 






100 










105 










110 






Phe 


Glu 


Ser 


Ser 


Leu 


Tyr 


Glu 


Gly 


His 


Phe 


Leu 


Ala 


Cys 


Gin 


Lys 


Glu 






115 










120 










125 








Asp 


Asp 


Ala 


Phe 


Lys 


Leu 


lie 


Leu 


Lys 


Lys 


Lys 


Asp 


Glu 


Asn 


Gly 


Asp 


130 










135 










140 










Lys 


Ser 


Val 


Met 


Phe 


Thr 


Leu 


Thr 


Asn 


Leu 


His 


Gin 


Ser 








145 










150 










155 












(28) 


INFORMATION FOR SEQ ID 


NO: 


28: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 





(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 


28: 










Asn 


Phe 


Gly 


Arg 


Leu 


His 


Cys 


Thr 


Thr 


Ala 


Val 


He 


Arg 


Asn 


He 


Asn 


1 








5 










10 










15 




Asp 


Gin 


Val 


Leu 


Phe 


Val 


Asp 


Lys 


Arg 


Gin 


Pro 


Val 


Phe 


Glu 


Asp 


Met 








20 










25 










30 






Thr 


Asp 


He 


Asp 


Gin 


Ser 


Ala 


Ser 


Glu 


Pro 


Gin 


Thr 


Arg 


Leu 


He 


He 






35 










40 










45 








Tyr 


Met 


Tyr 


Lys 


Asp 


Ser 


Glu 


Val 


Arg 


Gly 


Leu 


Ala 


Val 


Thr 


Leu 


Ser 




50 










55 










60 










Val 


Lys 


Asp 


Ser 


Lys 


Met 


Ser 


Thr 


Leu 


Ser 


Cys 


Lys 


Asn 


Lys 


He 


He 
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65 70 75 80 

Ser Phe Glu Glu Met Asp Pro Pro Glu Asn lie Asp Asp lie Gin Ser 

85 90 95 

Asp Leu lie Phe Phe Gin Lys Arg Val Pro Gly His Asn Lys Met Glu 

100 105 110 

Phe Glu Ser Ser Leu Tyr Glu Gly His Phe Leu Ala Ser Gin Lys Glu 

115 120 125 

Asp Asp Ala Phe Lys Leu lie Leu Lys Lys Lys Asp Glu Asn Gly Asp 

130 135 140 

Lys Ser Val Met Phe Thr Leu Thr Asn Leu His Gin Ser 
145 150 155 
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